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PTO-1590 (8-01) 



GenCore version 4.5 
Copyright (c) 1993 - 2000 Compugen Ltd. 

OM protein - protein search, using sw model 

Run on: March 29, 2002, 10:05:32 ; Search time 23.75 Seconds 

(without alignments) 
96.685 Million cell updates/sec 

Title: US-09-997-792-2 
Perfect score: 138 

Sequence: 1 XXEGTFTSDVSSYLXGQAAKXFIAWLVKGRX 3 1 

Scoring table: BLOSUM62 

Gapop 10.0, Gapext 0.5 

Searched: 522463 seqs, 74073290 residues 

Total number of hits satisfying chosen parameters: 253 143 

Minimum DB seq length: 0 
Maximum DB seq length: 35 

Post-processing: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_l 101:* 

1 : /SIDS8/gcgdata/geneseq/geneseqp/AA1980.DAT:* 
2: /SID S 8/gcgdata/geneseq/geneseqp/ AA 1981. DAT : * 
3: /SIDS8/gcgdata/geneseq/geneseqp/AA1982.DAT:* 
4: /SIDS8/gcgdata/geneseq/geneseqp/AA1983.DAT:* 
5: /SIDS8/gcgdata/geneseq/geneseqp/AA1984.DAT:* 
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1 3 : /SIDS8/gcgdata/geneseq/geneseqp/AAl 992.DAT: * 
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22: /SIDS8/gcgdata/geneseq/geneseqp/AA2001 .DAT: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAR93056 

ID AAR93056 standard; peptide; 28 AA. 
XX 

AC AAR93056; 
XX 

DT 27-SEP-1996 (first entry) 
XX 

DE N-terminally truncated GLP-1 analog #12. 
XX 

KW Truncation; glucagon-like insulinotropic peptide; GLP; proglucagon; 
KW chemical synthesis; proteolysis; fragmentation; recombinant DNA; glucose; 
KW stimulation; secretion; serum uptake; diabetes mellitus; hyperglycaemia. 
XX 

OS Synthetic. 
XX 

FH Key Location/Qualifiers 
FT Modified-site 28 

FT /note= "contains C-terminal NH2 group" 

XX 

PN EP699686-A2. 
XX 

PD 06-MAR-1996. 



XX 

PF 25-AUG-1995; 95EP-0305963. 
XX 

PR 30-AUG-1994; 94US-0297731. 
XX 

PA (ELIL ) LILLY & CO ELI. 
XX 

PI Johnson WT, Yakubu-madus FE; 
XX 

DR WPI; 1996-130713/14. 
XX 

PT New C-terminal fragments of glucagon-like peptide GLP-1 - useful as 

PT hypoglycaemic agents for treating diabetes 

XX 

PS Claim 2; Page 2 1 ; 23pp; English. 
XX 

CC Peptides AAR93045-60 represent examples of novel N-terminally truncated 

CC glucagon-like insulinotropic peptide (GLP)-l peptides. The peptides can 

CC be synthesised chemically, derived by proteolytic fragmentation of 

CC proglucagon or produced by recombinant DNA technology. This peptide 

CC represents residues 9-36 of the full length GLP-1 . The novel peptides 

CC lack the insulinotropic activity of GLP-1 i.e. they do not stimulate 

CC secretion of insulin, whilst retaining an activity allowing increased 

CC serum uptake of glucose. The peptides are thus useful for treating 

CC diabetes mellitus or hyperglycaemia. 

XX 

SQ Sequence 28 AA; 



Query Match 94.9%; Score 131; DB 17; Length 28; 

Best Local Similarity 92.9%; Pred. No. l.le-14; 

Matches 26; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 3 EGTFTSD VS S YLXGQ AAKXFI AWL VKGR 30 

minium inn i 

Db 1 egtftsdvssylegqaakefiawlvkgr 28 



RESULT 14 
AAR45435 

ID AAR45435 standard; protein; 30 AA. 
XX 

AC AAR45435; 
XX 



DT 27-JUN-1994 (first entry) 
XX 

DE Insulinotropin derivative. 
XX 

KW Insulinotropic; activity; enhancing insulin activity; treatment; 

KW Type II diabetes. 

XX 

OS Synthetic. 
XX 

PN W09325579-A. 
XX 

PD 23-DEC-1993. 
XX 

PF 14-APR-1993; 93WO-US03388. 
XX 

PR 15-JUN-1992; 92US-0899073. 
XX 

PA (PFIZ ) PFIZER INC. 
XX 

PI Andrews GC, Daumy GO, Francoeur ML, Larson ER; 
XX 

DR WPI; 1994-007457/01. 
XX 

PT New derivs. of glucagon-like peptide 1 and insulinotropin - used for 
PT enhancing insulin action in a mammal, partic. by iontophoretic admin. 
XX 

PS Claim 3; Page 20; 32pp; English. 
XX 

CC The sequence is that of a derivative of insulinotropin which 
CC has insulinotropic activity and is useful for enhancing insulin 
CC action in a mammal, partic. for treating Type II diabetes 
CC (claimed). It is partic. suited for delivery to a mammal by 
CC ionophoresis. 
XX 

SQ Sequence 30 AA; 



Query Match 94.9%; Score 131; DB 15; Length 30; 

Best Local Similarity 92.9%; Pred. No. l .le-14; 

Matches 26; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 3 EGTFTSD VS S YLXGQ AAKXFI AWL VKGR 30 

minium mil iniiiiii 

Db 3 egtftsdvssylegqaakefiawlvkgr 30 




Search completed: March 29, 2002, 10:10:55 
Job time: 323 sec 



GenCore version 4.5 
Copyright (c) 1993 - 2000 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: March 29, 2002, 10:08:42 ; Search time 12.47 Seconds 

(without alignments) 
55.942 Million cell updates/sec 

Title: US-09-997-792-2 
Perfect score: 138 

Sequence: 1 XXEGTFTSDVSSYLXGQAAKXFIAWLVKGRX 31 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 212252 seqs, 22503292 residues 

Total number of hits satisfying chosen parameters: 139820 

Minimum DB seq length: 0 
Maximum DB seq length: 35 

Post-processing: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: * 

/cgn2_6/ptodata/2/iaa/5A_COMB.pep:* 
/cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 
/cgn2_6/ptodata/2/iaa/6 A_COMB . pep : * 
/cgn2_6/ptodata/2/iaa/6B_COMB . pep : * 
/cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep:* 
/cgn2_6/ptodata/2/iaa^ackflles 1 . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 



Result Query 
No. Score Match Length DB ID 



Description 



1 


131 


94.9 


28 


2 


131 


94.9 


28 


3 


131 


94.9 


28 


4 


131 


94.9 


29 


5 


131 


94.9 


29 


6 


131 


94.9 


29 


7 


131 


94.9 


29 


8 


131 


94.9 


29 


9 


131 


94.9 


30 


10 


131 


94.9 


30 


11 


131 


94.9 


30 


12 


131 


94.9 


30 


13 


131 


94.9 


30 


14 


131 


94.9 


30 


15 


131 


94.9 


30 


16 


131 


94.9 


30 


17 


131 


94.9 


30 


18 


131 


94.9 


30 


19 


131 


94.9 


30 


20 


131 


94.9 


30 


21 


131 


94.9 


30 


22 


131 


94.9 


31 


23 


131 


94.9 


31 


24 


131 


94.9 


31 


25 


131 


94.9 


31 


26 


131 


94.9 


31 


27 


131 


94.9 


31 


28 


131 


94.9 


31 


29 


131 


94.9 


31 


30 


131 


94.9 


31 


31 


131 


94.9 


31 


32 


131 


94.9 


31 


33 


131 


94.9 


31 


34 


131 


94.9 


31 


35 


131 


94.9 


31 


36 


131 


94.9 


31 


37 


131 


94.9 


31 


38 


128 


92.8 


28 


39 


128 


92.8 


28 


40 


128 


92.8 


31 


41 


128 


92.8 


31 


42 


128 


92.8 


31 


43 


128 


92.8 


31 


44 


128 


92.8 


31 



1 US-08-297-731-9 
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Sequence 9, Appli 
Sequence 6, Appli 

Sequence 9, Appli 
Sequence 10, Appl 
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Sequence 5, Appli 
Sequence 10, Appl 
Sequence 1 1, Appl 
Sequence 6, Appli 
Sequence 1, Appli 
Sequence 12, Appl 
Sequence 1, Appli 
Sequence 1, Appli 
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Sequence 4, Appli 
Sequence 3, Appli 
Sequence 12, Appl 
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Sequence 3, Appli 
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Sequence 3, Appli 
Sequence 1, Appli 
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Sequence 28, Appl 
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ALIGNMENTS 



RESULT 1 
US-08-297-731-9 

; Sequence 9, Application US/08297731 

; Patent No. 5574008 

; GENERAL INFORMATION: 

; APPLICANT: Johnson, William T. 

; APPLICANT: Yakubu-Madus, Fatima E. 

; TITLE OF INVENTION: BIOLOGICALLY ACTIVE FRAGMENTS OF 

; TITLE OF INVENTION: GLUCAGON-LIKE INSULINOTROPIC PEPTIDE 

; NUMBER OF SEQUENCES: 13 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Eli Lilly and Company/RSM 

; STREET: Lilly Corporate Center 

; CITY: Indianapolis 

; STATE: IN 

; COUNTRY: USA 

; ZIP: 46285 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/08/297,731 

; FILING DATE: 

; CLASSIFICATION: 514 

; ATTORNEY/AGENT INFORMATION: 

; NAME: Maciak, Ronald S. 

; REGISTRATION NUMBER: 35,262 

; REFERENCE/DOCKET NUMBER: X9630 

; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: 317-276-1664 

; TELEFAX: 317-277-1917 

; INFORMATION FOR SEQ ID NO: 9: 

; SEQUENCE CHARACTERISTICS: 

; LENGTH: 28 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 

; TOPOLOGY: linear 





; MOLECULE TYPE: peptide 
; FEATURE: 

; NAME/KEY: Modified-site 
; LOCATION: 27 . 28 

; OTHER INFORMATION: /note= "C-terminal amide" 
US-08-297-731-9 



Query Match 94.9%; Score 131; DB 1; Length 28; 

Best Local Similarity 92.9%; Pred. No. 1.8e-14; 

Matches 26; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 3 EGTFTSDVSSYLXGQAAKXFIAWLVKGR 30 




Search completed: March 29, 2002, 10:1 1:14 
Job time: 152 sec 




Search completed: March 29, 2002, 10:13:13 
Job time: 186 sec 





GenCore version 4.5 



Copyright (c) 1993 - 2000 Compugen Ltd. 
OM protein - protein search, using sw model 

Run on: March 29, 2002, 10:09:22 ; Search time 12.79 Seconds 

(without alignments) 
184.629 Million cell updates/sec 

Title: US-09-997-792-2 
Perfect score: 138 

Sequence: 1 XXEGTFT SD VS S YLXGQ AAKXFI AWL VKGRX 3 1 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext0.5 

Searched: 219241 seqs, 76174552 residues 

Total number of hits satisfying chosen parameters: 7547 

Minimum DB seq length: 0 
Maximum DB seq length: 35 

Post-processing: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : PIR 68:* 



1: 


pirl:* 


2: 


pir2:* 


3: 


pir3 : * 


4: 


pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 



Result Query 
No. Score Match Length DB ED 



Description 



1 105 76.1 30 2 B61125 



glucagon-like pept 



2 


105 


76.1 


3 


93 


61 A 


4 


82 


59.4 


5 


80 


58.0 


6 


78 


56.5 


7 


75 


54.3 


8 


75 


54.3 


9 


75 


54.3 


10 


75 


54.3 


11 


75 


54.3 


12 


75 


54.3 


13 


73 


52.9 


14 


73 


52.9 


15 


73 


52.9 


16 


73 


52.9 


17 


70 


50.7 


18 


69 


50.0 


19 


68 


49.3 


20 


68 


49.3 


21 


68 


49.3 


22 


49.5 


35.9 


23 


46 


33.3 


24 


40 


29.0 


25 


39 


Aft ■*> 

28.3 


26 


35.5 


25.7 


27 


35 


25.4 


28 


35 


25.4 


29 


35 


25.4 


30 


35 


25.4 


31 


32.5 


23.6 


32 


32 


23.2 


33 


31 


22.5 


34 


31 


22.5 


35 


29 


21.0 


36 


29 


21.0 


37 


29 


21.0 


38 


29 


21.0 


39 


29 


21.0 


40 


29 


21.0 


41 


29 


21.0 


42 


29 


21.0 


43 


28 


20.3 


44 


28 


20.3 


45 


28 


20.3 



30 2 C61125 

30 2 S44473 
29 2 S07211 
29 1 GCDF 
29 1 GCEN 
29 1 GCOPV 
29 2 A91740 
29 2 A91741 
29 2 A91742 
29 2 C39258 

31 2 S44472 
29 1 GCDK 
29 1 A61583 
29 1 GCTTS 
31 2 S44471 
29 2 C60840 
29 1 GCCB 
29 1 GCFLE 
29 2 A61135 

29 2 S39018 
31 2 S39019 
17 2 A60317 
27 1 S07443 
27 2 A61071 

30 2 B53088 

26 1 B57082 

27 1 SEBO 
27 1 SESH 
27 2 A27267 

31 2 S74160 
27 2 T36655 
25 2 JQ0361 
27 1 SECH 
27 2 C60415 
27 4 S53133 
27 4 S53168 
27 4 S53154 
27 4 S53259 
27 4 S20754 
30 2 S07065 
35 1 HWGHD 
27 4 S53139 
27 4 S53151 
27 4 S53213 




glucagon-like pept 
glucagon-like pept 
glucagon - marbled 
glucagon - smaller 
glucagon - elephan 
glucagon - North A 
glucagon - turkey 
glucagon - rabbit 
glucagon - Arabian 
glucagon - common 
glucagon G2 - Nort . 
glucagon - duck 
glucagon - ostrich 
glucagon - slider 
glucagon Gl - Nort 
glucagon I - Europ 
glucagon - Chinchi 
glucagon - Europea 
glucagon - bigeye 
glucagon - bowfin 
glucagon-like pept 
glucagon-like pept 
secretin - human 
pituitary adenylat 
factor IX/factor X 
secretin - guinea 
secretin - bovine 
secretin - sheep 
secretin - dog 
1 8K protein - shee 
probable small mem 
vasoactive intesti 
secretin - chicken 
secretin - rabbit 
probable pre-core 
probable pre-core 
probable pre-core 
probable pre-core 
probable pre-core 
rRNA N-glycosidase 
exendin-2 - Gila m 
probable pre-core 
probable pre-core 
probable pre-core 



ALIGNMENTS 



RESULT 1 
B61125 

glucagon-like peptide - American eel 
C;Species: Anguilla rostrata (American eel) 

C;Date: 10-Mar-1994 #sequence_revision 10-Mar-1994 #text_change 21 -Nov- 1997 
C; Accession: B61125 

R;Conlon, J.M.; Andrews, PC; Thim, L.; Moon, T.W. 
Gen. Comp. Endocrinol. 82, 23-32, 1991 

A;Title: The primary structure of glucagon-like peptide but not insulin has been conserved between 

the American eel, Anguilla rostrata and the European eel, Anguilla anguilla. 

A;Reference number: A61 125; MUID:91 340068 

A;Accession: B61125 

A;Molecule type: protein 

A;Residues: 1-30<CON> 

C;Superfamily: glucagon 

C;Keywords: amidated carboxyl end; duplication 

F;l-30/Product: glucagon-like peptide #status experimental <GLP> 

F;30/Modified site: amidated carboxyl end (Arg) #status predicted 



Query Match 76.1%; Score 105; DB 2; Length 30; 

Best Local Similarity 71.4%; Pred. No. 6.3e-10; 

Matches 20; Conservative 3; Mismatches 5; Indels 0; Gaps 0; 

Qy 3 EGTFTSDVS S YLXGQ AAKXFIAWLVKGR 3 0 

Db 3 EGTYTSDVSSYLQDQAAKEFVSWLKTGR 30 

Search completed: March 29, 2002, 10:11:33 
Job time: 131 sec 



GenCore version 4.5 
Copyright (c) 1993 - 2000 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: March 29, 2002, 10:11:17; Search time 10.06 Seconds 

(without alignments) 
1 12.983 Million cell updates/sec 

Title: US-09-997-792-2 
Perfect score: 138 

Sequence: 1 XXEGTFTSDVSSYLXGQAAKXFIAWLVKGRX 3 1 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext0.5 

Searched: 100059 seqs, 36664827 residues 

Total number of hits satisfying chosen parameters: 2214 

Minimum DB seq length: 0 
Maximum DB seq length: 35 

Post-processing: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_39:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 



Result Query 
No. Score Match Length DB ID 



Description 



2 
3 
4 
5 



105 

100 

82 

80 

78 



76.1 
72.5 
59.4 
58.0 
56.5 



30 1 GLUMANGAN 
33 1 GLU2_ORENI 
29 1 GLUCTORMA 
29 1 GLUC_SCYCA 
29 1 GLUC CALMI 



P41521 anguilla an 
P81027 oreochromis 
P09567 torpedo mar 
P09687 scyliorhinu 
P13189 callorhynch 



6 


75 


54.3 


29 


1 


/"^T T T/"1 r\Tr\H AT A 

GLUCDIDMA 


7 


75 


54.3 


29 


1 


/ — \ f X T T AH #T" , T 

GLUCLAMFL 


8 


75 


54.3 


29 


1 


/~"»T T T/"t ~T4 A T"\ TT' 

GLUCRABIT 


9 


73 


52.9 


29 


1 


GLUCANAPL 


10 


69 


50.0 


29 


1 


GLUCCHTBR 


11 


68 


49.3 


29 


1 


GLUCPLAFE 


12 


35 


25.4 


27 


1 


SECRCANFA 


13 


35 


25.4 


27 


1 


SECRSHEEP 


14 


31 


22.5 


25 


1 


Vn > _GADM0 


15 


31 


22.5 


27 


1 


SECRCHICK 


16 


31 


22.5 


35 


1 


T~" A/"T - ' ^ t ii it OTT 

EXE2HELSU 


17 


29 


21.0 


27 


1 


SECRRABIT 


18 


29 


21.0 


30 


1 


RIPSMOMCO 


19 


28 


20.3 


24 


1 


1 |\ O 1 T T A 1 — " T~V t 

IR3 1_HAEIN 


20 


28 


20.3 


35 


1 


T'^r 1 /~»T** A OT» 

TX1 GRASP 


21 


27 


19.6 


28 


1 


^ n n A T T Ik ATT 

VIPALLMI 


22 


27 


19.6 


28 


1 


VIPRANRI 


23 


27 


19.6 


32 


1 


Y169TREPA 


24 


27 


19.6 


34 


1 


TX1SC0GR 


25 


26 


18.8 


26 


1 


DMS5PHYBI 


26 


25 


18.1 


28 


1 


VIPSCYCA 


27 


25 


18.1 


31 


1 


CU54LOCMI 


28 


25 


18.1 


31 


1 


GP37BPSP1 


29 


25 


18.1 


35 


1 


TX2GRASP 


30 


24 


17.4 


28 


1 


t T*T OAT T^X T 

LPLSALTY 


31 


24 


1 *7 A 

17.4 


30 


1 


T» O A H K n T~K TTT T 

PSAM_PINTH 


32 


24 


1 T A 

17. 4 


33 


1 


A T" 1 f\ T% A "K TT* T T 

GAE2RANRU 


33 


24 


17 .4 


33 


1 


/^i a t~" O T» A "K TT* T T 

GAE3RANRU 


34 


24 


1 *7 A 

17.4 


35 


1 


YRKMB AC SU 


35 


23 


16.7 


19 


1 


ATT /™\T "1 — » T — ' T T 

ALL70LEEU 


36 


23 


16.7 


31 


1 


T\TT T"X^ f— x TT*T»T T 

DR7XDIPPU 


37 


23 


16.7 


33 


1 


STCONCMY 


38 


23 


16.7 


33 


1 


\7TTA7 OAT HP A 7" 

YJJXSALTY 


39 


23 


16.7 


35 


1 


END4YEREN 


40 


22 


i /* Oj 

15.9 


13 


1 


TEMLRANTE 


41 


22 


15.9 


14 


1 


MAST VESBA 


42 


22 


15.9 


28 


1 


GTS5_CHICK 


43 


22 


15.9 


31 


1 


LCCB LEUME 


44 


22 


15.9 


32 


1 


RIP2 PHYDI 


45 


22 


15.9 


33 


1 


GAST DIDMA 




PI 8 108 didelphism 
Q9prq9 lampetra fl 
P25449 oryctolagus 
P01276 anas platyr 
P31297 chinchilla 
P23062 platichthys 
P09910canisfamil 
P31299 ovis aries 
P09684 gadus morhu 
P01280 gallus gall 
P04204 heloderma s 
P32647 oryctolagus 

P20655 momordica c 
P35756 haemophilias 
P56852 grammostola 
P48 142 alligator m 
P8f016rana ridibu 
083 199 treponema p 
P56855 scodra gris 
P81487 phyllomedus 
P09685 scyliorhinu 
PI 1738 locusta mig 
048393 bacteriopha 
P56853 grammostola 
P03062 salmonella 
P41601 pinus thunb 
P80396 rana rugosa 
P80397 rana rugosa 
P54440 bacillus su 
P81430 oleaeuropa 
P82372 diploptera 
P43648 oncorhynchu 
P39432 salmonella 
P42691 yersinia en 
P57104 rana tempor 
P21654 vespa basal 
P20 137 gallus gall 

P81052 leuconostoc 
P34967 Phytolacca 
P33713 didelphism 



ALIGNMENTS 



RESULT 1 
GLUM_ANGAN 

ID GLUMANGAN STANDARD; PRT; 30 AA. 
AC P41521; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 01-NOV-1995 (Rel. 32, Last annotation update) 

DE GLUCAGON-LIKE PEPTIDE (GLP). 

OS Anguilla anguilla (European freshwater eel), and 

OS Anguilla rostrata (American eel). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Actinopterygii; Neopterygii; Teleostei; Anguilliformes; Anguilloidei; 
OC Anguillidae; Anguilla. 
OX NCBI_TaxID=7936, 7938; 
RN [1] 

RP SEQUENCE. 

RC TISSUE=Pancreas; 

RX MEDLINE=91340068;PubMed=1874385; 

RA Conlon J.M., Andrews PC, Thim L., Moon T.W.; 

RT "The primary structure of glucagon-liice peptide but not insulin has 

RT been conserved between the American eel, Anguilla rostrata and the 

RT European eel, Anguilla anguilla "; 

RL Gen. Comp. Endocrinol. 82:23-32(1991). 

CC -!- SIMILARITY: BELONGS TO THE GLUCAGON FAMILY. 

DR PIR;B61125;B61125. 

DR PIR; C61125; C61125. 

DR HSSP;P01274; 1GCN. 

DR InterPro; IPR000532; Glucagon. 

DR Pfam; PF00123; hormone2; 1. 

DR PRINTS; PR00275; GLUCAGON. 

DR SMART; SM00070; GLUCA; 1. 

DR PROSITE; PS00260; GLUCAGON; 1. 

KW Glucagon family; Amidation. 

FT MOD RES 30 30 AMIDATION. 

SQ SEQUENCE 30 AA; 3376 MW; 592DA5EABD6E49D0 CRC64; 

Query Match 76.1%; Score 105; DB 1; Length 30; 

Best Local Similarity 71.4%; Pred. No. 2.1e-10; 

Matches 20; Conservative 3; Mismatches 5; Indels 0; Gaps 0 

Qy 3 EGTFTSDVSSYLXGQAAKXFIAWLVKGR 30 

Mill llll|::|| II 
Db 3 EGTYTSDVSSYLQDQAAKEFVSWLKTGR 30 



Search completed: March 29, 2002, 10:14:20 
Job time: 183 sec 



GenCore version 4.5 
Copyright (c) 1993 - 2000 Compugen Ltd. 

OM protein - protein search, using sw model 

Run on: March 29, 2002, 10:10:57 ; Search time 21.96 Seconds 

(without alignments) 
206.486 Million cell updates/sec 

Title: US-09-997-792-2 
Perfect score: 138 

Sequence: 1 XXEGTFTSDVSSYLXGQAAKXFIAWLVKGRX 31 

Scoring table: BLOSUM62 

Gapop 10.0, Gapext0.5 

Searched: 473505 seqs, 146272329 residues 

Total number of hits satisfying chosen parameters: 1 6942 

Minimum DB seq length: 0 
Maximum DB seq length: 35 

Post-processing: Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database: SPTREMBL17:* 
1: sp_archea:* 
2: sp_bacteria:* 
3: sp fungi:* 
4: sp_human:* 
5: sp invertebrate:* 
6: sp_mammal:* 
7: sp_mhc:* 
8: sp_organelle:* 
9: sp_phage:* 
10: sp_plant:* 
11: sprodent:* 
12: sp_virus:* 
13: sp_vertebrate:* 
14: sp unclassified:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



INO. 


Score Match Length DB ED 


Description 


1 


37 


26.8 


35 2 Q9R4M5 


Q9r4m5 amycolatops 


z 


35.5 


25.7 


30 13 Q9PS05 


Q9ps05 bothropsja 


i 


35 


25.4 


28 13 Q9PRN8 


Q9prn8 carassius a 


A 

4 


33 


23.9 


33 6 Q9TSD2 


Q9tsd2 bos taurus 


C 

J 


32 


23.2 


25 4 Q9UNK3 


Q9unk3 homo sapien 


O 


32 


23.2 


26 4 Q9UQN6 


Q9uqn6 homo sapien 


7 


32 


23.2 


27 2 Q9X902 


QyxyOz streptomyce 


Q 
0 


31 


22.5 


29 12 Q86242 


Q86242 human rotav 


Q 

y 


31 


22.5 


29 12 Q86243 


(^86243 human rotav 


1U 


30 


21.7 


25 12 037187 


037187 hepatitis c 


1 1 
1 1 


30 


21.7 


25 12 037188 


Oi/188 hepatitis c 


1 o 
iz 


30 


21.7 


25 12 037189 


(J37189 hepatitis c 


1 1 


30 


21.7 


25 12 037190 


U37190 hepatitis c 


1 A 


30 


21.7 


25 12 037191 


U37191 hepatitis c 


1 c 
1 J 


30 


21.7 


25 12 037192 


03/192 hepatitis c 


10 


30 


21.7 


25 12 037193 


U37193 hepatitis c 


1 7 
1 / 


30 


21.7 


25 12 037194 


Uj / 1 y4 hepatitis c 


1 8 
1 o 


30 


21.7 


35 2 P82554 


r8Z3D4 streptococc 


1 Q 


29 


21.0 


20 2 Q9R5A6 


r» C rtkn still, i 

i^yrjao tniobacillu 


on 
zu 


29 


21.0 


25 12 Q67095 


/095 lntluenza a 


0 1 
Zl 


29 


21.0 


27 12 Q68059 


Q68059 hepatitis b 


00 
ZZ 


29 


21.0 


27 12 Q67958 


Q67958 hepatitis b 


Zj 


29 


21.0 


27 12 Q67975 


Q67975 hepatitis b 


Ozl 

Zh 


29 


21.0 


27 12 Q67988 


Q67988 hepatitis b 


0^ 

Z J 


29 


21.0 


27 12 Q81171 


i^o ii/i hepatitis b 


26 


29 


21.0 


27 12 Q9WRL1 


Q9wrll hepatitis b 


27 


29 


21.0 


27 12 Q89528 


Q89528 hepatitis b 


28 


29 


21.0 


29 6 Q9N1W3 


Q9nlw3 equus cabal 


29 


29 


21.0 


29 12 Q81170 


Q81170 hepatitis b 


30 


29 


21.0 


30 12 Q81176 


Q81176 hepatitis b 


31 


29 


21.0 


31 12 Q81188 


Q81188 hepatitis b 


32 


29 


21.0 


33 2 Q53421 


Q53421 xanthobacte 


33 


28 


20.3 


27 12 Q67945 


Q67945 hepatitis b 


34 


28 


20.3 


27 12 Q67963 


Q67963 hepatitis b 


35 


28 


20.3 


27 12 Q67972 


Q67972 hepatitis b 


36 


28 


20.3 


27 12 Q68022 


Q68022 hepatitis b 



37 
38 
39 
40 
41 
42 
43 
44 
45 



28 
28 
28 
28 
28 
28 
28 
28 
28 



20.3 
20.3 
20.3 
20.3 
20.3 
20.3 
20.3 
20.3 
20.3 



27 12 037181 
27 12 037197 

27 12 037216 

28 2 P97144 

28 4 Q16325 

29 12 Q86241 
29 12 Q86240 
29 12 Q86244 
29 12 Q86245 



037181 hepatitis c 
037197 hepatitis c 
037216 hepatitis c 
P97144 staphylococ 
Q 16325 homo sapien 



Q 8 6241 human rotav 
Q86240 human rotav 
Q86244 human rotav 
Q86245 human rotav 



ALIGNMENTS 



RESULT 1 
Q9R4M5 

ID Q9R4M5 PRELIMINARY; PRT; 35 AA. 
AC Q9R4M5; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 
DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 
DT 01-MAY-2000 (TrEMBLrel. 13, Last annotation update) 
DE PREPHENATE DEHYDRATASE (FRAGMENT). 
OS Amycolatopsis methanolica. 

OC Bacteria; Firmicutes; Actinobacteria; Actinobacteridae; 

OC Actinomycetales; Pseudonocardineae; Pseudonocardiaceae; Amycolatopsis. 

OX NCBI_TaxID=1814; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=95275250; PubMed=7755580; 
RA Euverink G.J., Wolters D.J., Dijkhuizen L.; 

RT "Prephenate dehydratase of the actinomycete Amycolatopsis methanolica: 
RT purification and characterization of wild-type and deregulated mutant 
RT proteins."; 

RL Biochem. J. 308:313-320(1995). 

SQ SEQUENCE 35 AA; 3646 MW; 3AA0FC5D5F97887F CRC64; 

Query Match 26.8%; Score 37; DB 2; Length 35; 

Best Local Similarity 42.9%; Pred. No. 59; 

Matches 9; Conservative 3; Mismatches 1; Indels 8; Gaps 1; 
Qy 4 GTFTSDVSSYLXGQAAKXFIA 24 

llll 

Db 1 0 GTFT GEAARTFMA 22 



Search completed: March 29, 2002, 10:14:04 
Job time: 1 87 sec 



